Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 18562 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.5 MiB |
| Average record size in memory | 82.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Boolean | 2 |
MedInc is highly overall correlated with AveRooms and 1 other fields | High correlation |
AveRooms is highly overall correlated with MedInc | High correlation |
Latitude is highly overall correlated with Longitude and 2 other fields | High correlation |
Longitude is highly overall correlated with Latitude and 2 other fields | High correlation |
MedHouseVal is highly overall correlated with MedInc | High correlation |
west_of_lon120 is highly overall correlated with Latitude and 2 other fields | High correlation |
north_of_lat36 is highly overall correlated with Latitude and 2 other fields | High correlation |
Reproduction
| Analysis started | 2023-05-26 17:38:25.158765 |
|---|---|
| Analysis finished | 2023-05-26 17:38:35.894845 |
| Duration | 10.74 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
MedInc
Real number (ℝ)
| Distinct | 11684 |
|---|---|
| Distinct (%) | 62.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6745204 |
| Minimum | 0.4999 |
|---|---|
| Maximum | 9.9055 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 0.4999 |
|---|---|
| 5-th percentile | 1.60003 |
| Q1 | 2.539275 |
| median | 3.4643 |
| Q3 | 4.5893 |
| 95-th percentile | 6.526765 |
| Maximum | 9.9055 |
| Range | 9.4056 |
| Interquartile range (IQR) | 2.050025 |
Descriptive statistics
| Standard deviation | 1.5241343 |
|---|---|
| Coefficient of variation (CV) | 0.41478455 |
| Kurtosis | 0.35319227 |
| Mean | 3.6745204 |
| Median Absolute Deviation (MAD) | 1.006 |
| Skewness | 0.72499648 |
| Sum | 68206.447 |
| Variance | 2.3229853 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.875 | 42 | 0.2% |
| 3.125 | 42 | 0.2% |
| 3.875 | 36 | 0.2% |
| 4.125 | 36 | 0.2% |
| 3.625 | 35 | 0.2% |
| 3 | 35 | 0.2% |
| 2.625 | 35 | 0.2% |
| 4.375 | 34 | 0.2% |
| 4 | 34 | 0.2% |
| 3.375 | 32 | 0.2% |
| Other values (11674) | 18201 |
| Value | Count | Frequency (%) |
| 0.4999 | 2 | |
| 0.536 | 4 | |
| 0.5495 | 1 | < 0.1% |
| 0.6433 | 1 | < 0.1% |
| 0.6775 | 1 | < 0.1% |
| 0.6825 | 1 | < 0.1% |
| 0.6831 | 1 | < 0.1% |
| 0.6991 | 1 | < 0.1% |
| 0.7007 | 1 | < 0.1% |
| 0.7054 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.9055 | 1 | |
| 9.8346 | 1 | |
| 9.8074 | 1 | |
| 9.7037 | 1 | |
| 9.6986 | 1 | |
| 9.6062 | 1 | |
| 9.6023 | 1 | |
| 9.5908 | 1 | |
| 9.5862 | 1 | |
| 9.5823 | 1 |
HouseAge
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.889613 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 19 |
| median | 29 |
| Q3 | 37 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 12.348483 |
|---|---|
| Coefficient of variation (CV) | 0.42743678 |
| Kurtosis | -0.78539152 |
| Mean | 28.889613 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.040759735 |
| Sum | 536249 |
| Variance | 152.48504 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 52 | 1059 | 5.7% |
| 36 | 803 | 4.3% |
| 35 | 774 | 4.2% |
| 16 | 692 | 3.7% |
| 34 | 633 | 3.4% |
| 17 | 612 | 3.3% |
| 26 | 577 | 3.1% |
| 33 | 573 | 3.1% |
| 32 | 519 | 2.8% |
| 25 | 511 | 2.8% |
| Other values (42) | 11809 |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 2 | 33 | 0.2% |
| 3 | 45 | 0.2% |
| 4 | 142 | |
| 5 | 199 | |
| 6 | 140 | |
| 7 | 138 | |
| 8 | 167 | |
| 9 | 178 | |
| 10 | 227 |
| Value | Count | Frequency (%) |
| 52 | 1059 | |
| 51 | 43 | 0.2% |
| 50 | 119 | 0.6% |
| 49 | 125 | 0.7% |
| 48 | 163 | 0.9% |
| 47 | 190 | 1.0% |
| 46 | 228 | 1.2% |
| 45 | 274 | 1.5% |
| 44 | 331 | 1.8% |
| 43 | 334 | 1.8% |
AveRooms
Real number (ℝ)
| Distinct | 17476 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.1644141 |
| Minimum | 0.84615385 |
|---|---|
| Maximum | 9.8307692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 0.84615385 |
|---|---|
| 5-th percentile | 3.4317002 |
| Q1 | 4.3965104 |
| median | 5.1412284 |
| Q3 | 5.8840728 |
| 95-th percentile | 7.0609231 |
| Maximum | 9.8307692 |
| Range | 8.9846154 |
| Interquartile range (IQR) | 1.4875624 |
Descriptive statistics
| Standard deviation | 1.1102088 |
|---|---|
| Coefficient of variation (CV) | 0.21497284 |
| Kurtosis | 0.15960843 |
| Mean | 5.1644141 |
| Median Absolute Deviation (MAD) | 0.74402219 |
| Skewness | 0.1533985 |
| Sum | 95861.854 |
| Variance | 1.2325635 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5 | 27 | 0.1% |
| 4.5 | 20 | 0.1% |
| 6 | 18 | 0.1% |
| 4 | 18 | 0.1% |
| 5.333333333 | 12 | 0.1% |
| 5.5 | 11 | 0.1% |
| 4.666666667 | 8 | < 0.1% |
| 5.666666667 | 7 | < 0.1% |
| 6.090909091 | 7 | < 0.1% |
| 7 | 6 | < 0.1% |
| Other values (17466) | 18428 |
| Value | Count | Frequency (%) |
| 0.8461538462 | 1 | |
| 1 | 1 | |
| 1.130434783 | 1 | |
| 1.260869565 | 1 | |
| 1.378486056 | 1 | |
| 1.411290323 | 1 | |
| 1.465753425 | 1 | |
| 1.550408719 | 1 | |
| 1.553030303 | 1 | |
| 1.598130841 | 1 |
| Value | Count | Frequency (%) |
| 9.830769231 | 1 | |
| 9.691891892 | 1 | |
| 9.505154639 | 1 | |
| 9.466666667 | 1 | |
| 9.431884058 | 1 | |
| 9.286821705 | 1 | |
| 9.263598326 | 1 | |
| 9.232253086 | 1 | |
| 9.142857143 | 1 | |
| 9 | 1 |
AveBedrms
Real number (ℝ)
| Distinct | 12837 |
|---|---|
| Distinct (%) | 69.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0537599 |
| Minimum | 0.8 |
|---|---|
| Maximum | 1.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 0.8 |
|---|---|
| 5-th percentile | 0.94002864 |
| Q1 | 1.0041852 |
| median | 1.0459409 |
| Q3 | 1.0931911 |
| 95-th percentile | 1.1965374 |
| Maximum | 1.4 |
| Range | 0.6 |
| Interquartile range (IQR) | 0.089005917 |
Descriptive statistics
| Standard deviation | 0.079690275 |
|---|---|
| Coefficient of variation (CV) | 0.075624698 |
| Kurtosis | 1.9641624 |
| Mean | 1.0537599 |
| Median Absolute Deviation (MAD) | 0.044139102 |
| Skewness | 0.82841005 |
| Sum | 19559.892 |
| Variance | 0.00635054 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 265 | 1.4% |
| 1.058823529 | 25 | 0.1% |
| 1.125 | 24 | 0.1% |
| 1.052631579 | 23 | 0.1% |
| 1.083333333 | 22 | 0.1% |
| 1.1 | 22 | 0.1% |
| 1.05 | 21 | 0.1% |
| 1.090909091 | 20 | 0.1% |
| 1.055555556 | 18 | 0.1% |
| 1.076923077 | 17 | 0.1% |
| Other values (12827) | 18105 |
| Value | Count | Frequency (%) |
| 0.8 | 7 | |
| 0.8058823529 | 1 | < 0.1% |
| 0.8064516129 | 1 | < 0.1% |
| 0.8085106383 | 1 | < 0.1% |
| 0.8098159509 | 1 | < 0.1% |
| 0.8125 | 1 | < 0.1% |
| 0.8130841121 | 1 | < 0.1% |
| 0.8134920635 | 1 | < 0.1% |
| 0.813559322 | 1 | < 0.1% |
| 0.8155339806 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.4 | 4 | |
| 1.399558499 | 1 | < 0.1% |
| 1.397959184 | 1 | < 0.1% |
| 1.397590361 | 1 | < 0.1% |
| 1.396959459 | 1 | < 0.1% |
| 1.396825397 | 1 | < 0.1% |
| 1.396666667 | 1 | < 0.1% |
| 1.394736842 | 1 | < 0.1% |
| 1.394230769 | 1 | < 0.1% |
| 1.393767705 | 1 | < 0.1% |
Population
Real number (ℝ)
| Distinct | 3519 |
|---|---|
| Distinct (%) | 19.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1377.4422 |
| Minimum | 5 |
|---|---|
| Maximum | 4992 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 403 |
| Q1 | 816 |
| median | 1188 |
| Q3 | 1726 |
| 95-th percentile | 3058.9 |
| Maximum | 4992 |
| Range | 4987 |
| Interquartile range (IQR) | 910 |
Descriptive statistics
| Standard deviation | 823.40117 |
|---|---|
| Coefficient of variation (CV) | 0.59777548 |
| Kurtosis | 2.4498016 |
| Mean | 1377.4422 |
| Median Absolute Deviation (MAD) | 429 |
| Skewness | 1.4058855 |
| Sum | 25568082 |
| Variance | 677989.48 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1227 | 24 | 0.1% |
| 1052 | 24 | 0.1% |
| 891 | 24 | 0.1% |
| 782 | 22 | 0.1% |
| 1098 | 21 | 0.1% |
| 781 | 21 | 0.1% |
| 872 | 21 | 0.1% |
| 850 | 21 | 0.1% |
| 1005 | 21 | 0.1% |
| 1047 | 20 | 0.1% |
| Other values (3509) | 18343 |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 3 | |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 2 | |
| 14 | 1 | < 0.1% |
| 15 | 2 | |
| 19 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4992 | 1 | |
| 4985 | 1 | |
| 4983 | 1 | |
| 4976 | 1 | |
| 4970 | 1 | |
| 4956 | 1 | |
| 4952 | 1 | |
| 4951 | 1 | |
| 4945 | 1 | |
| 4944 | 1 |
AveOccup
Real number (ℝ)
| Distinct | 17118 |
|---|---|
| Distinct (%) | 92.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9314308 |
| Minimum | 0.97058824 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 0.97058824 |
|---|---|
| 5-th percentile | 1.9019506 |
| Q1 | 2.4528869 |
| median | 2.8425133 |
| Q3 | 3.3034155 |
| 95-th percentile | 4.2999635 |
| Maximum | 6 |
| Range | 5.0294118 |
| Interquartile range (IQR) | 0.85052856 |
Descriptive statistics
| Standard deviation | 0.71674177 |
|---|---|
| Coefficient of variation (CV) | 0.24450236 |
| Kurtosis | 1.021369 |
| Mean | 2.9314308 |
| Median Absolute Deviation (MAD) | 0.41973565 |
| Skewness | 0.7840321 |
| Sum | 54413.219 |
| Variance | 0.51371877 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3 | 28 | 0.2% |
| 2 | 14 | 0.1% |
| 2.666666667 | 12 | 0.1% |
| 2.5 | 12 | 0.1% |
| 3.2 | 11 | 0.1% |
| 2.555555556 | 9 | < 0.1% |
| 2.8 | 9 | < 0.1% |
| 2.6 | 9 | < 0.1% |
| 2.333333333 | 8 | < 0.1% |
| 4 | 8 | < 0.1% |
| Other values (17108) | 18442 |
| Value | Count | Frequency (%) |
| 0.9705882353 | 1 | |
| 1.066176471 | 1 | |
| 1.089267803 | 1 | |
| 1.161290323 | 1 | |
| 1.169329073 | 1 | |
| 1.215873016 | 1 | |
| 1.239616613 | 1 | |
| 1.244556114 | 1 | |
| 1.24516129 | 1 | |
| 1.263565891 | 1 |
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 5.995680346 | 1 | |
| 5.96941896 | 1 | |
| 5.961077844 | 1 | |
| 5.950248756 | 1 | |
| 5.941176471 | 1 | |
| 5.940340909 | 1 | |
| 5.933554817 | 1 | |
| 5.927066451 | 1 | |
| 5.923076923 | 1 |
Latitude
Real number (ℝ)
| Distinct | 839 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.630331 |
| Minimum | 32.54 |
|---|---|
| Maximum | 41.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 32.54 |
|---|---|
| 5-th percentile | 32.81 |
| Q1 | 33.93 |
| median | 34.26 |
| Q3 | 37.72 |
| 95-th percentile | 38.9 |
| Maximum | 41.95 |
| Range | 9.41 |
| Interquartile range (IQR) | 3.79 |
Descriptive statistics
| Standard deviation | 2.1308925 |
|---|---|
| Coefficient of variation (CV) | 0.059805577 |
| Kurtosis | -1.1255001 |
| Mean | 35.630331 |
| Median Absolute Deviation (MAD) | 1.28 |
| Skewness | 0.45389167 |
| Sum | 661370.21 |
| Variance | 4.5407029 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 34.08 | 215 | 1.2% |
| 34.05 | 203 | 1.1% |
| 34.09 | 200 | 1.1% |
| 34.02 | 190 | 1.0% |
| 34.07 | 189 | 1.0% |
| 34.06 | 188 | 1.0% |
| 34.04 | 188 | 1.0% |
| 34.1 | 184 | 1.0% |
| 34.03 | 179 | 1.0% |
| 33.93 | 174 | 0.9% |
| Other values (829) | 16652 |
| Value | Count | Frequency (%) |
| 32.54 | 1 | < 0.1% |
| 32.55 | 3 | < 0.1% |
| 32.56 | 9 | < 0.1% |
| 32.57 | 18 | |
| 32.58 | 26 | |
| 32.59 | 11 | |
| 32.6 | 8 | < 0.1% |
| 32.61 | 14 | |
| 32.62 | 13 | |
| 32.63 | 16 |
| Value | Count | Frequency (%) |
| 41.95 | 2 | |
| 41.92 | 1 | < 0.1% |
| 41.88 | 1 | < 0.1% |
| 41.86 | 3 | |
| 41.84 | 1 | < 0.1% |
| 41.81 | 1 | < 0.1% |
| 41.8 | 3 | |
| 41.78 | 3 | |
| 41.77 | 1 | < 0.1% |
| 41.76 | 2 |
Longitude
Real number (ℝ)
| Distinct | 789 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -119.58285 |
| Minimum | -124.35 |
|---|---|
| Maximum | -114.55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 18562 |
| Negative (%) | 100.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | -124.35 |
|---|---|
| 5-th percentile | -122.46 |
| Q1 | -121.79 |
| median | -118.495 |
| Q3 | -118.01 |
| 95-th percentile | -117.09 |
| Maximum | -114.55 |
| Range | 9.8 |
| Interquartile range (IQR) | 3.78 |
Descriptive statistics
| Standard deviation | 1.9927056 |
|---|---|
| Coefficient of variation (CV) | -0.016663808 |
| Kurtosis | -1.3509814 |
| Mean | -119.58285 |
| Median Absolute Deviation (MAD) | 1.285 |
| Skewness | -0.30175907 |
| Sum | -2219696.9 |
| Variance | 3.9708758 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -118.3 | 155 | 0.8% |
| -118.31 | 155 | 0.8% |
| -118.29 | 142 | 0.8% |
| -118.27 | 142 | 0.8% |
| -118.28 | 135 | 0.7% |
| -118.19 | 129 | 0.7% |
| -118.35 | 129 | 0.7% |
| -118.36 | 128 | 0.7% |
| -118.32 | 128 | 0.7% |
| -118.25 | 125 | 0.7% |
| Other values (779) | 17194 |
| Value | Count | Frequency (%) |
| -124.35 | 1 | < 0.1% |
| -124.3 | 2 | < 0.1% |
| -124.27 | 1 | < 0.1% |
| -124.26 | 1 | < 0.1% |
| -124.23 | 3 | < 0.1% |
| -124.22 | 1 | < 0.1% |
| -124.21 | 3 | < 0.1% |
| -124.19 | 4 | < 0.1% |
| -124.18 | 6 | |
| -124.17 | 12 |
| Value | Count | Frequency (%) |
| -114.55 | 1 | < 0.1% |
| -114.57 | 2 | |
| -114.58 | 2 | |
| -114.59 | 2 | |
| -114.6 | 3 | |
| -114.61 | 3 | |
| -114.62 | 1 | < 0.1% |
| -114.63 | 1 | < 0.1% |
| -114.64 | 1 | < 0.1% |
| -114.65 | 2 |
MedHouseVal
Real number (ℝ)
| Distinct | 3799 |
|---|---|
| Distinct (%) | 20.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9372673 |
| Minimum | 0.14999 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 290.0 KiB |
Quantile statistics
| Minimum | 0.14999 |
|---|---|
| 5-th percentile | 0.66 |
| Q1 | 1.18 |
| median | 1.754 |
| Q3 | 2.5 |
| 95-th percentile | 3.84795 |
| Maximum | 5 |
| Range | 4.85001 |
| Interquartile range (IQR) | 1.32 |
Descriptive statistics
| Standard deviation | 0.97362999 |
|---|---|
| Coefficient of variation (CV) | 0.50257908 |
| Kurtosis | 0.1196008 |
| Mean | 1.9372673 |
| Median Absolute Deviation (MAD) | 0.64 |
| Skewness | 0.78594188 |
| Sum | 35959.555 |
| Variance | 0.94795536 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.375 | 103 | 0.6% |
| 1.625 | 99 | 0.5% |
| 1.125 | 85 | 0.5% |
| 1.875 | 85 | 0.5% |
| 2.25 | 83 | 0.4% |
| 3.5 | 72 | 0.4% |
| 0.875 | 65 | 0.4% |
| 1.5 | 60 | 0.3% |
| 2.75 | 59 | 0.3% |
| 1.75 | 57 | 0.3% |
| Other values (3789) | 17794 |
| Value | Count | Frequency (%) |
| 0.14999 | 1 | < 0.1% |
| 0.175 | 1 | < 0.1% |
| 0.225 | 2 | |
| 0.25 | 1 | < 0.1% |
| 0.266 | 1 | < 0.1% |
| 0.269 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| 0.325 | 3 | |
| 0.332 | 1 | < 0.1% |
| 0.344 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 26 | |
| 4.991 | 1 | < 0.1% |
| 4.99 | 1 | < 0.1% |
| 4.988 | 1 | < 0.1% |
| 4.987 | 1 | < 0.1% |
| 4.984 | 1 | < 0.1% |
| 4.976 | 1 | < 0.1% |
| 4.974 | 1 | < 0.1% |
| 4.964 | 2 | < 0.1% |
| 4.96 | 1 | < 0.1% |
west_of_lon120
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 163.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 11150 | |
| True | 7412 |
north_of_lat36
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 163.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 10589 | |
| True | 7973 |
| MedInc | HouseAge | AveRooms | AveBedrms | Population | AveOccup | Latitude | Longitude | MedHouseVal | west_of_lon120 | north_of_lat36 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MedInc | 1.000 | -0.183 | 0.671 | -0.269 | 0.007 | -0.041 | -0.076 | -0.003 | 0.652 | 0.029 | 0.034 |
| HouseAge | -0.183 | 1.000 | -0.228 | -0.087 | -0.293 | -0.016 | 0.031 | -0.138 | 0.049 | 0.164 | 0.160 |
| AveRooms | 0.671 | -0.228 | 1.000 | 0.004 | -0.075 | 0.044 | 0.136 | -0.056 | 0.249 | 0.156 | 0.171 |
| AveBedrms | -0.269 | -0.087 | 0.004 | 1.000 | 0.072 | -0.102 | 0.041 | -0.008 | -0.124 | 0.037 | 0.033 |
| Population | 0.007 | -0.293 | -0.075 | 0.072 | 1.000 | 0.234 | -0.117 | 0.122 | 0.014 | 0.115 | 0.124 |
| AveOccup | -0.041 | -0.016 | 0.044 | -0.102 | 0.234 | 1.000 | -0.169 | 0.196 | -0.256 | 0.223 | 0.201 |
| Latitude | -0.076 | 0.031 | 0.136 | 0.041 | -0.117 | -0.169 | 1.000 | -0.885 | -0.158 | 0.936 | 0.988 |
| Longitude | -0.003 | -0.138 | -0.056 | -0.008 | 0.122 | 0.196 | -0.885 | 1.000 | -0.066 | 0.981 | 0.936 |
| MedHouseVal | 0.652 | 0.049 | 0.249 | -0.124 | 0.014 | -0.256 | -0.158 | -0.066 | 1.000 | 0.096 | 0.216 |
| west_of_lon120 | 0.029 | 0.164 | 0.156 | 0.037 | 0.115 | 0.223 | 0.936 | 0.981 | 0.096 | 1.000 | 0.879 |
| north_of_lat36 | 0.034 | 0.160 | 0.171 | 0.033 | 0.124 | 0.201 | 0.988 | 0.936 | 0.216 | 0.879 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| MedInc | HouseAge | AveRooms | AveBedrms | Population | AveOccup | Latitude | Longitude | MedHouseVal | west_of_lon120 | north_of_lat36 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 8.3252 | 41.0 | 6.984127 | 1.023810 | 322.0 | 2.555556 | 37.88 | -122.23 | 4.526 | True | True |
| 1 | 8.3014 | 21.0 | 6.238137 | 0.971880 | 2401.0 | 2.109842 | 37.86 | -122.22 | 3.585 | True | True |
| 2 | 7.2574 | 52.0 | 8.288136 | 1.073446 | 496.0 | 2.802260 | 37.85 | -122.24 | 3.521 | True | True |
| 3 | 5.6431 | 52.0 | 5.817352 | 1.073059 | 558.0 | 2.547945 | 37.85 | -122.25 | 3.413 | True | True |
| 4 | 3.8462 | 52.0 | 6.281853 | 1.081081 | 565.0 | 2.181467 | 37.85 | -122.25 | 3.422 | True | True |
| 5 | 4.0368 | 52.0 | 4.761658 | 1.103627 | 413.0 | 2.139896 | 37.85 | -122.25 | 2.697 | True | True |
| 6 | 3.6591 | 52.0 | 4.931907 | 0.951362 | 1094.0 | 2.128405 | 37.84 | -122.25 | 2.992 | True | True |
| 7 | 3.1200 | 52.0 | 4.797527 | 1.061824 | 1157.0 | 1.788253 | 37.84 | -122.25 | 2.414 | True | True |
| 8 | 2.0804 | 42.0 | 4.294118 | 1.117647 | 1206.0 | 2.026891 | 37.84 | -122.26 | 2.267 | True | True |
| 9 | 3.6912 | 52.0 | 4.970588 | 0.990196 | 1551.0 | 2.172269 | 37.84 | -122.25 | 2.611 | True | True |
| MedInc | HouseAge | AveRooms | AveBedrms | Population | AveOccup | Latitude | Longitude | MedHouseVal | west_of_lon120 | north_of_lat36 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 20630 | 3.5673 | 11.0 | 5.932584 | 1.134831 | 1257.0 | 2.824719 | 39.29 | -121.32 | 1.120 | True | True |
| 20631 | 3.5179 | 15.0 | 6.145833 | 1.141204 | 1200.0 | 2.777778 | 39.33 | -121.40 | 1.072 | True | True |
| 20632 | 3.1250 | 15.0 | 6.023377 | 1.080519 | 1047.0 | 2.719481 | 39.26 | -121.45 | 1.156 | True | True |
| 20633 | 2.5495 | 27.0 | 5.445026 | 1.078534 | 1082.0 | 2.832461 | 39.19 | -121.53 | 0.983 | True | True |
| 20634 | 3.7125 | 28.0 | 6.779070 | 1.148256 | 1041.0 | 3.026163 | 39.27 | -121.56 | 1.168 | True | True |
| 20635 | 1.5603 | 25.0 | 5.045455 | 1.133333 | 845.0 | 2.560606 | 39.48 | -121.09 | 0.781 | True | True |
| 20636 | 2.5568 | 18.0 | 6.114035 | 1.315789 | 356.0 | 3.122807 | 39.49 | -121.21 | 0.771 | True | True |
| 20637 | 1.7000 | 17.0 | 5.205543 | 1.120092 | 1007.0 | 2.325635 | 39.43 | -121.22 | 0.923 | True | True |
| 20638 | 1.8672 | 18.0 | 5.329513 | 1.171920 | 741.0 | 2.123209 | 39.43 | -121.32 | 0.847 | True | True |
| 20639 | 2.3886 | 16.0 | 5.254717 | 1.162264 | 1387.0 | 2.616981 | 39.37 | -121.24 | 0.894 | True | True |